Ultimate Pólya Gamma Samplers – Efficient MCMC for possibly imbalanced binary and categorical data
نویسندگان
چکیده
Modeling binary and categorical data is one of the most commonly encountered tasks applied statisticians econometricians. While Bayesian methods in this context have been available for decades now, they often require a high level familiarity with statistics or suffer from issues such as low sampling efficiency. To contribute to accessibility models data, we introduce novel latent variable representations based on Pólya-Gamma random variables range logistic regression models. From these representations, new Gibbs algorithms binary, binomial, multinomial logit are derived. All allow conditionally Gaussian likelihood representation, rendering extensions more complex modeling frameworks state space straightforward. However, efficiency may still be an issue augmentation estimation frameworks. counteract this, marginal strategies developed discussed detail. The merits our approach illustrated through extensive simulations real applications.
منابع مشابه
Boltzmann Samplers, Pólya Theory, and Cycle Pointing
We introduce a general method to count unlabeled combinatorial structures and to efficiently generate them at random. The approach is based on pointing unlabeled structures in an “unbiased” way that a structure of size n gives rise to n pointed structures. We extend Pólya theory to the corresponding pointing operator, and present a random sampling framework based on both the principles of Boltz...
متن کاملAdaptive Gibbs samplers and related MCMC methods
We consider various versions of adaptive Gibbs and Metropoliswithin-Gibbs samplers, which update their selection probabilities (and perhaps also their proposal distributions) on the fly during a run, by learning as they go in an attempt to optimise the algorithm. We present a cautionary example of how even a simple-seeming adaptive Gibbs sampler may fail to converge. We then present various pos...
متن کاملOn Mining Fuzzy Classification Rules for Imbalanced Data
Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...
متن کاملGradient-based MCMC samplers for dynamic causal modelling
In this technical note, we derive two MCMC (Markov chain Monte Carlo) samplers for dynamic causal models (DCMs). Specifically, we use (a) Hamiltonian MCMC (HMC-E) where sampling is simulated using Hamilton's equation of motion and (b) Langevin Monte Carlo algorithm (LMC-R and LMC-E) that simulates the Langevin diffusion of samples using gradients either on a Euclidean (E) or on a Riemannian (R)...
متن کاملOn Mining Fuzzy Classification Rules for Imbalanced Data
Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Statistical Association
سال: 2023
ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']
DOI: https://doi.org/10.1080/01621459.2023.2259030